Electronic device and method of controlling operation of vehicle

ABSTRACT

An electronic device and method of controlling an operation of a vehicle are provided. The method includes obtaining a video sequence including a plurality of frames from a camera installed on the vehicle; detecting, from the plurality of frames, an object included in the plurality of frames; obtaining position information regarding the object with respect to each of the plurality of frames; determining whether an event related to driving of the vehicle has occurred by analyzing time-series changes in positions of the object in the plurality of frames; generating a notification message about the event based on a result of the determining; and outputting the generated notification message, wherein the detecting of the object, the obtaining of the position information, the determining of whether an event has occurred, and the generating of the notification message are performed using a plurality of learning models.

CROSS-REFERENCE TO RELATED APPLICATIONS

This application is based on and claims priority under 35 U.S.C. § 119to U.S. Provisional Patent Application No. 62/506,732, filed on May 16,2017, in the U.S. Patent and Trademark Office, and Korean PatentApplication No. 10-2017-0097814, filed on Aug. 1, 2017, in the KoreanIntellectual Property Office, the disclosures of which are incorporatedby reference herein in their entireties.

BACKGROUND 1. Field

The disclosure relates to an electronic device and method of controllingan operation of a vehicle, and for example, to an electronic device anda method of providing a notification message notifying about anoccurrence of an event related to driving of a vehicle according toposition of an object in a plurality of frames.

2. Description of Related Art

Along with developments in multimedia technology and network technology,a user may receive various services using electronic devices. Along withdevelopments in technologies applied to a vehicle, various methods ofrecognizing whether an event related to driving of a vehicle hasoccurred are being developed.

On the other hand, in autonomous driving technology, which requires aquick and accurate cognitive function, demand for a technology fordetermining whether an event related to driving of a vehicle hasoccurred more accurately with a limited amount of data and notifying auser via a notification message is increasing.

SUMMARY

An electronic device and method of providing a notification messageabout an occurrence of an event related to driving of a vehicle based onposition of an object in a plurality of frames using a plurality oflearning models are provided.

In accordance with an aspect of the disclosure, an electronic device forcontrolling an operation of a vehicle is provided, the electronic deviceincluding a memory configured to store at least one program; and atleast one processor configured to provide a notification message byexecuting the at least one program, wherein the at least one programincludes instructions, which when executed by the processor, cause theelectronic device to perform operations comprising: obtaining a videosequence including a plurality of frames from a camera installed on thevehicle; detecting, from the plurality of frames, an object included inthe plurality of frames; obtaining position information regarding theobject with respect to each of the plurality of frames; determiningwhether an event related to driving of the vehicle has occurred byanalyzing time-series changes in positions of the object in theplurality of frames; generating a notification message about the eventbased on a result of the determining; and outputting the generatednotification message, wherein the detecting of the object, the obtainingof the position information, the determining of whether an event hasoccurred, and the generating of the notification message are performedusing a plurality of learning models.

In accordance with another aspect of the disclosure, a method ofcontrolling an operation of a vehicle is provided, the method includingobtaining a video sequence including a plurality of frames from a camerainstalled on the vehicle; detecting, from the plurality of frames, anobject included in the plurality of frames; obtaining positioninformation regarding the object with respect to each of the pluralityof frames; determining whether an event related to driving of thevehicle has occurred by analyzing time-series changes in positions ofthe object in the plurality of frames; generating a notification messageabout the event based on a result of the determining; and outputting thegenerated notification message, wherein the detecting of the object, theobtaining of the position information, the determining of whether anevent has occurred, and the generating of the notification message areperformed using a plurality of learning models.

In accordance with another aspect of the disclosure, there is provided anon-transitory computer readable recording medium having recordedthereon a computer program for implementing the above method.

BRIEF DESCRIPTION OF THE DRAWINGS

The above and other aspects, features, and advantages of certainembodiments of the present disclosure will be more apparent from thefollowing detailed description, taken in conjunction with theaccompanying drawings, in which:

FIG. 1 is a diagram illustrating an example in which an electronicdevice according to an embodiment controls operation of a vehicle;

FIG. 2 is a flowchart illustrating a method by which an electronicdevice provides a notification message to a user, according to anembodiment;

FIG. 3 is a flowchart illustrating a method by which an electronicdevice provides a notification message to a user by using a learningmodel, according to an embodiment;

FIG. 4 is a diagram illustrating an example of generating a notificationmessage using a learning model, according to an embodiment;

FIG. 5 is a table illustrating how contents of a notification messageare determined, according to an embodiment;

FIG. 6 is a diagram illustrating an example of outputting a notificationmessage according to an embodiment;

FIG. 7 is a diagram illustrating an example of outputting a notificationmessage according to an embodiment;

FIGS. 8 and 9 are block diagrams illustrating an electronic deviceaccording to some embodiments;

FIG. 10 is a block diagram illustrating a processor according to anembodiment;

FIG. 11 is a block diagram illustrating a data learner according to anembodiment;

FIG. 12 is a block diagram illustrating a data recognizer according toan embodiment; and

FIG. 13 is a diagram illustrating an example of learning and recognizingdata as an electronic device operates in conjunction with a server,according to an embodiment.

DETAILED DESCRIPTION

Reference will now be made in greater detail to various exampleembodiments, examples of which are illustrated in the accompanyingdrawings, wherein like reference numerals refer to like elementsthroughout. In this regard, the present embodiments may have differentforms and should not be understood as being limited to the descriptionsset forth herein. Therefore, the embodiments are merely described below,by referring to the figures, to explain aspects. As used herein, theterm “and/or” includes any and all combinations of one or more of theassociated listed items. Expressions such as “at least one of,” whenpreceding a list of elements, modify the entire list of elements and donot modify the individual elements of the list.

Hereinafter, various example embodiments will be described more fullywith reference to the accompanying drawings. The embodiments may,however, be embodied in many different forms and should not beunderstood as being limited to the embodiments set forth herein. Rather,the embodiments are provided so that this disclosure will be thoroughand complete, and will fully convey the scope of the disclosure to thoseof skill in the art. In drawings, certain elements may be omitted forclarity, and like elements denote like reference numerals throughout thedisclosure.

Throughout the disclosure, it will be understood that when a portion isreferred to as being “connected to” another portion, it can be “directlyconnected to” the other portion or “electrically connected to” the otherportion via another element. Furthermore, it will be further understoodthat the terms “comprises” and/or “comprising” used herein specify thepresence of stated features or components, but do not preclude thepresence or addition of one or more other features or components.

Hereinafter, embodiments will be described in greater detail withreference to accompanying drawings.

FIG. 1 is a diagram illustrating an example in which an electronicdevice 1000 according to an embodiment controls operation of a vehicle.

Referring to FIG. 1, the electronic device 1000 may be a deviceinstalled on a vehicle, and the electronic device 1000 may receive avideo sequence from a camera installed on the vehicle and transmitnotification messages indicating occurrences of various events to auser.

Although it is described above that the electronic device 1000 receivesa video sequence from a camera installed on the vehicle, the presentdisclosure is not limited thereto, and the electronic device 1000 mayreceive a video sequence from a camera capable of photographing aperiphery of the vehicle. The periphery of the vehicle may include, forexample, and without limitation, areas to the front, sides, and rear ofthe vehicle, or the like.

According to an embodiment, the electronic device 1000 may provide anotification message including different contents depending on types ofevents and risk level of driving. For example, when it is determinedthat it is desirable to provide an action guideline corresponding to anevent instead of simply notifying the event based on the type of theevent and a risk level of driving, a notification message includinginformation regarding the event and an action guideline corresponding tothe event may be provided to a user. For example, an action guidelinecorresponding to an event may include a method of reducing a risk levelof driving.

According to an embodiment, when the electronic device 1000 determinesthat it is desirable to control an operation of a module installed on avehicle based on a type of an event and a risk level of driving, theelectronic device 1000 may transmit a command for controlling anoperation of the module installed on the vehicle to the module installedon the vehicle. For example, the electronic device 1000 may output anotification message and control an operation of the module installed onthe vehicle simultaneously, based on a type of an event and a risk levelof driving. When a user input for controlling an operation of the moduleinstalled on the vehicle is not received within a certain time periodafter the electronic device 1000 output a notification message, theelectronic device 1000 may transmit a command for controlling anoperation of the module installed on the vehicle to the module installedon the vehicle. The electronic device 1000 may control an operation ofthe module installed on the vehicle without outputting a notificationmessage, based on a type of an event and a risk level of driving.

The electronic device 1000 may be, for example, and without limitation,one or more of a head unit in a vehicle, an embedded board, a smartphone, a tablet PC, a PC, a smart TV, a mobile phone, a personal digitalassistant (PDA), a laptop computer, a vehicle, a media player, a microserver, a global positioning system (GPS) device, an electronic bookterminal, a digital broadcast terminal, a navigation device, a kiosk, anMP3 player, a digital camera, a consumer electronic device, and othermobile or non-mobile computing devices, or the like, but is not limitedthereto. Furthermore, the electronic device 1000 may be a wearableelectronic device, such as, for example, and without limitation, awatch, an eyeglass, a hair band, and a ring, having a communicationfunction and a data processing function, or the like, but is not limitedthereto. The electronic device 1000 may include any types of devicescapable of obtaining an image (e.g., a video and a still image) from acamera and providing a notification message to a user based on theobtained image.

According to an embodiment, the electronic device 1000 may, for example,include a module installed on a vehicle, may control operations of thevehicle, and may communicate with other modules installed on the vehiclevia a certain network.

According to an embodiment, the electronic device 1000 may, for example,include a device separate from a vehicle, such as a smart phone, or thelike, but is not limited thereto. In this case, the electronic device1000 may obtain a video sequence using a camera of the electronic device1000 or may receive a video sequence from a camera capable ofphotographing the periphery of the vehicle via a certain network.Furthermore, when the electronic device 1000 is a device separate fromthe vehicle, the electronic device 1000 may communicate with a moduleinstalled on the vehicle to control operations of the vehicle.

The vehicle may, for example, and without limitation, be a means oftransportation having communication function, data processing function,and transportation function, e.g., a car, a bus, a truck, a learn, abicycle, a motorcycle, or the like, but is not limited thereto.

Furthermore, the electronic device 1000 may communicate with a server2000 or another electronic device (not shown) via a certain network, to,for example, and without limitation, receive a video sequence, totransmit a notification message, and to transmit a command forcontrolling an operation of the other electronic device, or the like. Inthis case, the certain network may, for example, include a general datacommunication network that allows network components to communicatesmoothly with one another and includes a local area network (LAN), awide area network (WAN), a value added network (VAN), a mobile radiocommunication network, a satellite communication network, and mutualcombinations thereof and may include a wired Internet, a wirelessInternet, and a wireless communication network, or the like, but is notlimited thereto. The wireless communication network may include, but isnot limited to, Wi-Fi, Bluetooth, Bluetooth Low Energy, ZigBee, Wi-FiDirect (WFD), ultra wideband (UWB), infrared data association (IrDA),and Near Field Communication (NFC).

FIG. 2 is a flowchart illustrating an example method by which theelectronic device 1000 provides a notification message to a user,according to an embodiment.

In operation S210, the electronic device 1000 may obtain a videosequence including a plurality of frames from a camera installed on avehicle. According to an embodiment, the electronic device 1000 mayreceive a video sequence by communicating with a camera installed on avehicle via a certain network. For example, the video sequence may be ablack box image of the vehicle or an image received from a stereo cameraof the vehicle. According to an embodiment, the electronic device 1000may include a camera, and a video sequence may be obtained from thecamera included in the electronic device 1000.

A video sequence may include a series of still images. Each still imagemay be referred to as either a picture or a frame.

In operation S220, the electronic device 1000 may detect an objectincluded the plurality of frames from the plurality of frames includedin the video sequence. According to an embodiment, the electronic device1000 may detect one or more objects from one frame included in the videosequence. One or more objects detected from one frame may be detected inanother frame included in the same video sequence. One or more objectsdetected from one frame may not be detected in another frame included inthe same video sequence. For example, a road, a sidewalk, a firstvehicle, a second vehicle, a third vehicle, and a traffic sign may bedetected from a first frame of a video sequence, whereas only the road,the sidewalk, the first vehicle, and the third vehicle may be detectedfrom a second frame of the same video sequence and the second vehicleand the traffic sign may not be detected from the second frame.Furthermore, a motorcycle, which is not detected from the first frame,may be detected from the second frame.

According to an embodiment, the electronic device 1000 may determine atype of an object.

Types of an object may include, for example, and without limitation, aroad, a sidewalk, a building, a wall, a fence, a pole, a traffic light,a traffic sign, vegetation, a terrain, the sky, a person, a rider, acar, a truck, a bus, a learn, a motorcycle, a bicycle, or the like.

For example, the electronic device 1000 may detect a plurality ofobjects from one frame and determine respective types of the pluralityof objects. Furthermore, the electronic device 1000 may distinguishobjects of a same type even when some of the plurality of objects areobjects of a same type. For example, when three vehicles are detected inone frame, the electronic device 1000 may distinguish the three vehiclesfrom one another as a first vehicle, a second vehicle, and a thirdvehicle.

According to an embodiment, the electronic device 1000 may use a firstlearning model to detect objects included in a frame. When framesobtained from a video sequence are input to the first learning model,information regarding the objects detected from the frames may be outputfrom the first learning model. An operation for detecting an objectusing the first learning model will be described below with reference toFIG. 3.

In operation S230, the electronic device 1000 may obtain positionalinformation regarding objects for each of the plurality of framesincluded in the video sequence.

According to an embodiment, the electronic device 1000 may determinerespective positions of an object in the plurality of frames to obtainpositional information regarding the object. For example, the electronicdevice 1000 may determine a position of an object in one frame. Forexample, the electronic device 1000 may determine a position of theobject in another frame. Furthermore, for example, the electronic device1000 may determine positions of a plurality of objects in one frame. Forexample, the electronic device 1000 may determine positions of theplurality of objects in another frame. Therefore, the electronic device1000 may determine positions of the plurality of objects in each of theplurality of frames.

According to an embodiment, the electronic device 1000 may determinepositions of an object on the pixel-by-pixel basis. For example, theelectronic device 1000 may determine pixels indicating an object fromamong pixels of a frame. For example, when one frame includes aplurality of objects, the electronic device 1000 may determine pixelsrepresenting each of the plurality of objects. For example, theelectronic device 1000 may determine a detected object indicated by oneor more arbitrary pixels from among pixels of a frame.

The method by which the electronic device 1000 precisely obtainspositional information regarding an object on the pixel-by-pixel basisinstead of a bounding box may be applied to a technical field demandingaccurate cognitive functions. For example, the electronic device 1000may obtain positional information regarding an object on thepixel-by-pixel basis, thereby analyzing a change of the position of theobject in a time-series manner. Therefore, the electronic device 1000may be applied to an autonomous driving technique demanding fast andaccurate cognitive function.

According to an embodiment, the electronic device 1000 may use a firstlearning model to obtain positional information regarding an object.When a plurality of frames are input to the first learning model, pixelinformation may be output from the first learning model. The pixelinformation may be information regarding objects respectively indicatedby groups of pixels of a frame. Although the operation S220 and theoperation S230 have been described above as separate operations, thepresent disclosure is not limited thereto. For example, when a pluralityof frames are input to the first learning model, information regardingan object detected from the plurality of frames and pixel informationmay be output together. For example, only pixel information may beoutput. An operation for obtaining positional information regarding anobject using the first learning model will be described below withreference to FIG. 3.

In operation S240, the electronic device 1000 may determine whether anevent related to driving of the vehicle has occurred by analyzingtime-series changes of positions of the objects in the plurality offrames.

According to an embodiment, the electronic device 1000 may analyze aposition change of an object from a previous frame to a next frameaccording to a display order of a video sequence. For example, theelectronic device 1000 may compare positional information regarding anobject included in a first frame with positional information regardingthe same object included in a second frame, which is reproduced laterthan the first frame, thereby analyzing a position change of the object.For example, the electronic device 1000 may determine whether an eventhas occurred by analyzing respective position changes of a plurality ofobjects according to the lapse of time. Therefore, the electronic device1000 may more accurately determine whether an event has occurred basedon composite recognition of changes of position of a plurality ofobjects that are determined on the pixel-by-pixel basis instead oftracking bounding boxes regarding a region of interest (ROI). Forexample, when a first vehicle and a second vehicle are stopped forward,and a third vehicle, a fourth vehicle, and a fifth vehicle successivelychange lanes to the right, the electronic device 1000 may determine thatan event involving an accident between vehicles up ahead has occurred.Furthermore, since there are vehicles in an accident up ahead, an actionguideline indicating that it is desirable to change a lane to the rightmay be determined in correspondence to the event.

According to an embodiment, the electronic device 1000 may determine thetype of an event by analyzing the time-series changes of positions ofobjects in a plurality of frames.

Types of an event related to driving of a vehicle may include, but isnot limited to, a traffic signal change, a possible accident, a roadsituation change, and a terrain change, or the like. An example of atraffic signal changes may be that a traffic light changes from green tored or from red to green. Examples of a possible accident may includeinsufficient safety distances to a vehicle in front and/or a vehiclebehind, an appearance of an unexpected person, etc. An example of a roadsituation change may be that a road is blocked due to an accidentvehicle ahead. An example of a terrain change may include a winding roadahead, a hill ahead, or the like.

According to an embodiment, the electronic device 1000 may determine arisk level of driving by analyzing time-series changes of positions ofobjects in a plurality of frames.

For example, a risk level of driving may be indicated by a numericalvalue. The higher a numerical value, the higher a risk level of drivingmay be. For example, a risk level of driving may be indicated by aninteger between 1 and 100, and the electronic device 1000 may beconfigured to include a risk level of driving in a notification messagewhen the risk level of driving is equal to or greater than a criticalvalue. Furthermore, for example, a risk level of driving may beindicated as high, middle, and low.

According to an embodiment, the electronic device 1000 may use a secondlearning model to determine when an event has occurred. When pixelinformation output from the first learning model is input to the secondlearning model, information indicating whether an event has occurred maybe output. An operation for determining whether an event has occurredusing the second learning model will be described below with referenceto FIG. 3.

In operation S250, the electronic device 1000 may generate anotification message about an event based on the determination of anoccurrence of the event.

According to an embodiment, different notification messages may begenerated depending on factors including types of an object, time-serieschanges of the position of the object, and an occurrence of an event.

When it is determined in the operation S240 that an event has occurred,in the operation S250, electronic device 1000 may generate anotification message about the event. Furthermore, when it is determinedin the operation S240 that no event has occurred, in the operation S250,the electronic device 1000 may not generate a notification message aboutan event. According to an embodiment, when the electronic device 1000decides not to generate a notification message about an event, theelectronic device 1000 may generate no notification message or maygenerate a pre-set notification message including no notification of anevent. For example, the electronic device 1000 may generate anotification message that includes at least one of a currenttemperature, an rpm of a vehicle, a moving direction of the vehicle, atraffic situation, and a risk level of driving. According to anembodiment, a notification message that does not include an eventnotification may be set to a default value in the electronic device1000.

According to an embodiment, the electronic device 1000 may generate anotification message based on a type of an event and a risk level ofdriving. A method by which the electronic device 1000 determines contentof a notification message based on the type of an event and a risk levelof driving will be described below with reference to FIG. 5.

A notification message according to an embodiment may be generated as atext message or a voice message, or the like, but is not limitedthereto. Furthermore, for example, a notification message generated inthe form of a text message may be Text-to-Speech converted, therebyobtaining a voice notification message.

When pixel information output from the first learning model is input tothe second learning model, a notification message may be output.Although the operation S240 and the operation S250 are described aboveas separate operations, the present disclosure is not limited thereto.For example, when pixel information is input to the first learningmodel, information indicating whether an event has occurred and anotification message may be output together. Only a notification messagemay be output. An operation for generating a notification message usingthe second learning model will be described below with reference to FIG.3.

According to an embodiment, an operation for detecting an object, anoperation for obtaining positional information regarding the object, anoperation for determining whether an event has occurred, and anoperation for generating a notification message may be performed using aplurality of learning models.

In operation S260, the electronic device 1000 may output a generatednotification message.

According to an embodiment, a notification message may be output in theform of a sound, a text, an image, and/or a vibration, or the like, butis not limited thereto.

According to an embodiment, the electronic device 1000 may display anotification message on a head-up display (HUD) or a dashboard of avehicle, or the like, but is not limited thereto.

According to an embodiment, the electronic device 1000 may output anotification message through a speaker of a vehicle when thenotification message is in the form of a voice. For example, theelectronic device 1000 may transmit a command for controlling thespeaker of the vehicle to output a notification message in the form of avoice to the speaker of the vehicle.

According to an embodiment, based on a type of an event and a risk levelof driving, a command for controlling an operation of a module installedon a vehicle may be transmitted to the corresponding module. When theelectronic device 1000 determines that it is desirable to control anoperation of the module installed on the vehicle based on the type of anevent and a risk level of driving, the electronic device 1000 maytransmit a command for controlling an operation of the module installedon the vehicle to the module installed on the vehicle. For example, theelectronic device 1000 may output a notification message and control anoperation of the module installed on the vehicle simultaneously, basedon the type of an event and a risk level of driving. When a user inputfor controlling an operation of the module installed on the vehicle isnot received within a certain time period after a notification messageis output, the electronic device 1000 may transmit a command forcontrolling an operation of the module installed on the vehicle to themodule installed on the vehicle. The electronic device 1000 may transmita command for controlling an operation of the module installed on thevehicle to the module installed on the vehicle without outputting anotification message, based on the type of an event and a risk level ofdriving.

A user input according to an embodiment may include, but is not limitedto, at least one of a step-on input, a steering input, a voice input, akey input, a touch input, a bending input, and a multimodal input, orthe like. The step-on input may refer to an input applied as a usersteps on a brake to control a vehicle's brake. The steering input mayrefer to an input applied as a user rotates a steering wheel to controla vehicle's steering.

FIG. 3 is a flowchart illustrating an example method by which anelectronic device 1000 provides a notification message to a user using alearning model, according to an embodiment.

In operation S310, the electronic device 1000 may apply a filter to aplurality of frames to flatten lightening degrees of a plurality offrames for inputting a plurality of frames to a first learning model.

The first learning model may, for example, and without limitation, begenerated by learning criteria for determining types of an object andlearning criteria for determining positions of an object in a pluralityof frames using a fully convolutional network (FCN).

According to an embodiment, the electronic device 1000 may convert RGBchannels of a frame into a luminance-chromatic (Lab) channel. The Lvalue of the converted Lab channel is a luminance value of the image,which is a value indicating the brightness of an image excluding colorinformation. The electronic device 1000 may perform preprocessing toapply a median filter for flattening L values of a plurality of framesincluded in a video sequence to the plurality of frames before inputtinga plurality of frames to the first learning model. By performing apreprocessing, an object may be detected more easily even when it isdark or rainy, and a plurality of objects may be distinguished from oneanother.

In operation S320, the electronic device 1000 may determine the type ofan object included in the plurality of frames using the first learningmodel.

For example, the electronic device 1000 may detect a plurality ofobjects from one frame and determine the type of each of the pluralityof objects using the first learning model. For example, when one frameis input to the first learning model, different values may be outputdepending on types of objects included in the frame. For example, in thefirst learning model, the sky may be set to a value 12, a plant may beset to a value 10, a road may be set to a value 4, a sidewalk may be setto a value 3, a vehicle may be set to a value 6, and a person may be setto a value 8. For example, when an input frame includes a plant, a road,a vehicle, a vehicle, and a person, an output of the first learningmodel may include values 4, 6, 8, and 10. Therefore, objects included ina frame may be detected using the first learning model. Furthermore, forexample, when a frame is input to the first learning model, a valuecorresponding to the type of an object may not be output, and pixelinformation regarding respective objects indicated by groups of pixelsof the frame may be output. The pixel information may be a matrix inwhich values corresponding to the types of an object are matched topositions of the object in a frame.

In operation S330, the electronic device 1000 may determine respectivepositions of an object in the plurality of frames using the firstlearning model.

According to an embodiment, the electronic device 1000 may determine notonly the type of an object, but also the positions of the object in aplurality of frames using the first learning model. For example, theelectronic device 1000 may determine the position of an object on thepixel-by-pixel basis using the first learning model. For example, sincean electronic device 1000 may determine certain pixels of the frameindicating respective objects, when a frame is input to the firstlearning model, a matrix in which values corresponding to the types ofan object are matched to positions of the object in the frame may beoutput. Since positions of an object are determined on thepixel-by-pixel basis, when the size of the frame is, for example,512×256, the size of a matrix may also be 512×256. In other words, as anoutput of the first learning model corresponding to input of a frame, amatrix including information regarding the type of an object andpositional information regarding the object may be obtained.

In operation S340, the electronic device 1000 may reduce the dimensionof an output of the first learning model to input the same to a secondlearning model.

The second learning model may be generated by learning criteria fordetermining whether an event related to driving of a vehicle hasoccurred and criteria for determining the content of a notificationmessage by analyzing time-series changes of positions of an object in aplurality of frames using, for example, and without limitation, arecurrent neural network (RNN).

According to an embodiment, an output of the first learning model may beused as an input to the second learning model. According to anotherembodiment, the electronic device 1000 may user a matrix obtained byreducing the dimension of a matrix output from the first learning modelas an input to the second learning model for reducing amounts ofcalculations of the second learning model for determining whether anevent has occurred and generating a notification message. For example,and without limitation, to reduce the dimension of a matrix, a dilatedconvolution may be used.

Furthermore, according to an embodiment, to reduce amounts ofcalculations of the first learning model, the electronic device 1000may, for example, perform 1×1 convolutional filtering on outputs oflayers included in the first learning model, thereby matching dimensionsbetween the layers included in the first learning model.

In operation S350, the electronic device 1000 may use the secondlearning model to determine when an event related to driving of avehicle has occurred.

According to an embodiment, when a first vehicle and a second vehicleare stopped forward and a third vehicle, a fourth vehicle, and a fifthvehicle successively change lanes to the right, the electronic device1000 may determine that an event corresponding to accident vehiclesahead has occurred. Furthermore, since there are accident vehiclesahead, an action guideline indicating that it is desirable to change alane to the right may be determined in correspondence to the event.

As described above, since the electronic device 1000 may obtaininformation regarding types objects and positional information regardingthe objects in an entire screen image with small calculation amountsusing the first learning model, time-series changes of positions of theobjects may be quickly and accurately analyzed without setting a regionof interest (ROI) unlike as a method of tracking an object.

Therefore, the electronic device 1000 may use the second learning modelto determine an occurrence of an event that may be detected by analyzingtime-series positional changes of an object in addition to eventsassociated with driving of a vehicle. For example, the electronic device1000 may generate subtitles in real time during reproduction of a movieusing the second learning model.

In operation S360, the electronic device 1000 may generate anotification message using the second learning model.

According to an embodiment, when an output of the first learning modelis processed to reduce calculation amounts and input to the secondlearning model, a notification message may be output. A controloperation corresponding to an event may be output or a control operationcorresponding to an event may be output together with a notificationmessage.

According to an embodiment, a notification message generated using thesecond learning model may differ depending on types of an event andrisks of driving. For example, depending on types of an event and risksof driving, contents included in a notification message may vary. Forexample, contents included in a notification message may include anotification of an event, an action guideline corresponding to theevent, an alarm sound, or the like. For example, when there are accidentvehicles ahead, the electronic device 1000 may include a notification ofan event and an action guideline corresponding to the event in anotification message, thereby generating a notification message “Thereare accident vehicles ahead, so please change a lane to the right.”

Contents included in a notification message will be described in greaterdetail below with reference to FIG. 5.

FIG. 4 is a diagram illustrating an example of generating a notificationmessage by using a learning model, according to an embodiment.

A convolutional neural network (CNN), for example, includes a fullyconnected layer in a rear layer to classify class of image data. When aninput image passes through the fully connected layer, positionalinformation regarding objects included in the input image disappears. Toresolve the problem, a fully convolutional network (FCN) considers thefully connected layer of a CNN as 1×1 convolution, thereby retainingpositional information of objects included in an input image.

According to an embodiment, the electronic device 1000 may input a videosequence 410 preprocessed for leveling the brightness to a firstlearning model. Since the first learning model uses a FCN, when thevideo sequence 410 is input to the first learning model, a series ofmatrices including information regarding types of objects and positionalinformation regarding the objects may be output. According to anembodiment, the video sequence 410, which has been preprocessed to beinput to the first learning model, may be input to the first learningmodel in the order of their reproduction. An output in which matricesare output from the first learning model may be the identical to theorder in which the video sequence 410 is input to the first learningmodel.

When a series of matrices output from the first learning model areimaged, a video sequence 420 indicated by different colors according totypes of objects included in the video sequence 410 may be obtained.When dilated convolution is performed in a video sequence 430 obtainedby dividing the video sequence 420 into pixels, a matrix 440, which hasa dimension reduced from that of a matrix output from the first learningmodel, may be obtained. The dilated convolution is a method ofperforming convolution using only some of pixels included in the videosequence 430. For example, by skipping one or more pixel and performingconvolution, the size of a matrix and calculation amounts may be reducedby extending the size of a receptive field (RF).

According to an embodiment, when the matrix 440 is input to the secondlearning model, a notification message 460 may be output. The secondlearning model is based, for example, on a recurrent neural network(RNN) where a neural network with recurrent connections between nodes indifferent time sections is referred to as the RNN. A RNN according to anembodiment may recognize sequential data. The sequential data is datawith temporality or a sequence, such as voice data, image data,biometric data, and handwriting data. For example, the recognition modelof the RNN may recognize a pattern according to which input image datachanges.

The RNN may be learned through supervised learning in which learningdata and output data corresponding thereto are input to a neural networktogether and connection weights of connection neurons are updated so asto output output data corresponding to the learning data. For example,the RNN may update connection weights between neurons based on deltarules and backpropagation learning.

The RNN may be a structure including a long short-term memory (LSTM)network 450. The LSTM network 450 is a type of RNN supporting long-termdependency learning. In an RNN that does not include the LSTM network450, information regarding a previous task may be connected to a currenttask, but it is difficult to connect information regarding a previoustask corresponding to a time point far from a current time point to acurrent task. The LSTM network 450 may be a structure designed to avoidsuch a long-term dependency problem. Since the LSTM network 450 mayextract a relative amount of change that varies according to the lapseof time from input data as a feature value, it may be determined whetheran event has occurred by analyzing a time-series change of positions ofan object.

Since the second learning model uses the RNN including the LSTM network450, structures for all of a previous time step, a current time step,and a next time step may be used for learning, and information regardinga current stage may be forwarded to a next stage and influences anoutput value.

According to an embodiment, the matrices 440 obtained by reducing thedimension of an output of the first learning model may be input to thesecond learning model in the order that the matrix 440 is output fromthe first learning model. The second learning model may generate anotification message by taking an occurrence of an event, the type ofthe event and a risk level of driving into account.

For convenience of explanation, the first learning model and the secondlearning model have been described separately. However, the firstlearning model and the second learning model may exist as a plurality oflearning models or a single integrated learning model according to theirfunctions and roles.

FIG. 5 is a table illustrating how contents of a notification messageare determined, according to an embodiment.

Referring to FIG. 5, a table 500 indicates how contents of anotification message are determined according to the type of an eventand a risk level of driving. The table 500 according to an embodiment ismerely an example, and a plurality of learning models may becontinuously updated. Therefore, output values according to input valuesto the plurality of learning models may be continuously updated. Theelectronic device 1000 may use a second learning model to outputdifferent notification messages depending on types of an event and risksof driving. For example, as shown in FIG. 5, when the type of an eventis a possible accident due to an insufficient safety distance to avehicle in front and a risk level of driving is high, the electronicdevice 1000 may generate a notification message including an alarm soundand an action guideline “please step on the brake immediately”corresponding to the event. Furthermore, according to an embodiment, theelectronic device 1000 may determine, based on the type of an event anda risk level of driving, a time period during which a user input forperforming an action guideline may be received. For example, a timeperiod during which a user input may be received may be determined basedon a risk level of driving. Furthermore, for example, data fordetermining a time period during which a user input may be received maybe set and changed based on a learning according to pre-set criteria.For example, when it is determined that a risk level of driving is highand a user input for controlling an operation of a module installed on avehicle is not received within a certain time period, the electronicdevice 1000 may transmit a command for controlling an operation of themodule installed on the vehicle to the module installed on the vehicle.

For example, when the type of an event is change of a terrain to awinding path ahead and a risk level of driving is medium, the electronicdevice 1000 may generate a notification message “there is a winding roadahead, so please be aware” including a notification of the event and anaction guideline corresponding to the event.

For example, when the type of an event is a change of a road situationdue to an accident vehicle ahead and a risk level of driving is low, theelectronic device 1000 may generate a notification message “there is anaccident vehicle ahead, so please change a lane to the right” includinga notification of the event and an action guideline corresponding to theevent.

For example, when the type of an event is a change of a traffic lightfrom green to red and a risk level of driving is high, the electronicdevice 1000 may generate a notification message “a traffic light ischanged, so please stop” including a notification of the event and anaction guideline corresponding to the event.

For example, when the type of an event is a change of a traffic lightfrom red to green and a risk level of driving is low, the electronicdevice 1000 may generate a notification message “a traffic light ischanged, so please drive” including a notification of the event and anaction guideline corresponding to the event.

FIG. 6 is a diagram illustrating an example of outputting a notificationmessage according to an embodiment.

According to an embodiment, the electronic device 1000 may display anotification message on, for example, a head up display (HUD) of avehicle.

For example, when the electronic device 1000 determines that an event inwhich an accident is anticipated due to an insufficient safety distanceto a vehicle in front 610 has occurred and a risk level of driving 630is 35, the electronic device 1000 may display a notification messageincluding the risk level of driving 630 and a virtual image 620 forestablishing a safety distance on a HUD of a vehicle. Furthermore, theelectronic device 1000 may output a notification message including analarm sound and an action guideline “please step on a brake immediately”corresponding to the event, in the form of voice. Furthermore, forexample, when a step-on input for stepping on a brake is not receivedfrom a user within a certain time period after a notification message isoutput, the electronic device 1000 may transmit a command forcontrolling an operation of the brake to the brake. For example, thecertain time period may be set based on learning and may vary dependingon the risk level of driving 630. For example, the higher the risk levelof driving 630 is, the smaller the certain time period may be set. Forexample, when a distance between the vehicle in front 610 and a user'svehicle is too small and an accident is expected without immediatelystepping on a brake, a command for controlling the operation of thebrake may be transmitted to the brake simultaneously as a notificationmessage is output.

FIG. 7 is a diagram illustrating an example of outputting a notificationmessage according to an embodiment.

According to an embodiment, the electronic device 1000 may transmit acommand for controlling an operation of a module installed on a vehicleto the module installed on the vehicle.

For example, when the electronic device 1000 determines that an eventrelated to a road situation change making it impossible to drive along acertain lane due to vehicles being in an accident up ahead has occurred,the electronic device 1000 may display a pre-set notification messagewithout a notification of the event on a HUD of a vehicle. For example,the electronic device 1000 may display a notification message thatincludes at least one of a current temperature, an rpm of a vehicle, amoving direction of the vehicle, a traffic situation, and a risk levelof driving on the HUD. Furthermore, the electronic device 1000 mayoutput a notification message “there are accident vehicles ahead, soplease change a lane to the right” in the form of a voice. Furthermore,when a steering input for rotating a steering wheel 710 is not receivedfrom a user within a certain time period after a notification message isoutput in the form of a voice, for example, a command for rotating thesteering wheel 710 may be transmitted to the steering wheel 710.Therefore, the electronic device 1000 may autonomously adjust a drivingpath by guiding a user to adjust the steering wheel 710 or transmittinga command for adjusting the steering wheel 710 to the steering wheel710.

FIGS. 8 and 9 are block diagrams illustrating an example of theelectronic device 1000 according to some embodiments.

As illustrated in FIG. 8, the electronic device 1000 according to anembodiment may include a processor (e.g., including processingcircuitry) 1300, a communicator (e.g., including communicationcircuitry) 1500, and a memory 1700. However, not all of the componentsillustrated in FIG. 8 are necessary components of the electronic device1000. The electronic device 1000 may be implemented by more componentsthan the components illustrated in FIG. 8. The electronic device 1000may also be implemented by fewer components than the componentsillustrated in FIG. 8.

For example, as illustrated in FIG. 9, the electronic device 1000according to an embodiment may further include an input unit (e.g.,including input circuitry) 1100, an output unit (e.g., including outputcircuitry) 1200, a sensing unit (e.g., including sensing circuitry)1400, and an audio/video (A/V) input unit (e.g., including A/V inputcircuitry) 1600 in addition to the processor 1300, the communicator(communication unit) 1500, and the memory 1700.

For example, the electronic device 1000 according to an embodiment maybe a vehicle dashboard including the processor 1300, the communicator1500, and the memory 1700. The electronic device 1000 according to anembodiment may be a vehicle including at least one of the input unit1100, the output unit 1200, the sensing unit 1400, and the A/V inputunit 1600 in addition to the processor 1300, the communicator(communication unit) 1500, and the memory 1700.

The input unit 1100 may refer, for example, to a unit including variousinput circuitry with which a user inputs data for controlling theelectronic device 1000. For example, the input unit 1100 may includevarious input circuitry, such as, for example, and without limitation, akey pad, a dome switch, a touch pad (a contact capacitance type, apressure resistive type, an infrared ray detecting type, a surfaceacoustic wave propagating type, an integral tension measuring type, apiezo-effect type, etc.), a jog wheel, and a jog switch, or the like,but is not limited thereto.

The input unit 1100 may receive a user input for controlling anoperation of a module installed on a vehicle.

The output unit 1200 may include various output circuitry that mayoutput an audio signal, a video signal, or a vibration signal, or thelike, and the output unit 1200 may include, for example, and withoutlimitation, a display 1210, an acoustic (e.g., sound) output unit 1220,and a vibration motor 1230, or the like. According to an embodiment, theoutput unit 1200 may output a notification message in the form of anaudio, a video, and/or vibration.

The display unit 1210 may include, for example, a display that displaysinformation processed by the electronic device 1000. For example, thedisplay 1210 may display a notification message in a head-up display(HUD) of a vehicle.

The acoustic (sound) output unit 1220 may include various circuitry thatoutputs audio data received from the communicator (communication unit)1500 or stored in the memory 1700. The acoustic output unit 1220 alsooutputs acoustic signals associated with functions performed by theelectronic device 1000 (e.g., a call signal reception tone, a messagereception tones, a notification tone, etc.). For example, the acousticoutput unit 1220 may output an alarm sound to notify that an event hasoccurred.

The processor 1300 may include various processing circuitry andtypically controls the overall operation of the electronic device 1000.For example, the processor 1300 may control the overall operations ofthe input unit 1100, the output unit 1200, the sensing unit 1400, thecommunication unit 1500, and the A/V input unit by executing programsstored in the memory 1700. Furthermore, the processor 1300 may performthe functions of the electronic device 1000 described above with respectto FIGS. 1 through 13 by executing programs stored in the memory 1700.The processor 1300 may include various processing circuitry, such as,for example, and without limitation, at least one processor. Theprocessor 1300 may include a plurality of processors or may include oneintegrated processor, depending on functions and roles of the processor1300. According to an embodiment, the processor 1300 may include atleast one processor for providing a notification message by executing atleast one program stored in the memory 1700.

According to an embodiment, the processor 1300 may obtain a videosequence including a plurality of frames from a camera installed on avehicle through the communicator 1500. According to an embodiment, theprocessor 1300 may transmit a command for controlling an operation of amodule installed on the vehicle to the module installed on the vehiclethrough the communication unit 1500, based on the type of an event and arisk level of driving.

According to an embodiment, the processor 1300 may detect an objectincluded in a plurality of frames. According to an embodiment, processor1300 may obtain positional information regarding an object for each ofthe plurality of frames. The processor 1300 may determine positions ofthe object on the pixel-by-pixel basis. According to an embodiment, theprocessor 1300 may determine whether an event related to driving of avehicle has occurred, by analyzing a time-series change of positions ofthe object in the plurality of frames. According to an embodiment, theprocessor 1300 may determine the type of the event and a risk level ofdriving by analyzing the time-series change of positions the object inthe plurality of frames. According to an embodiment, processor 1300 maygenerate a notification message that notifies the event based on adetermination of whether an event has occurred. According to anembodiment, the processor 1300 may generate a notification message thatnotifies the event based on the type of the event and a risk level ofdriving. According to an embodiment, the processor 1300 may control tooutput the generated notification message through the output unit 1200.According to an embodiment, the processor 1300 may control to displaythe generated notification message via the display unit 1210. Accordingto an embodiment, the processor 1300 may perform detection of an object,obtaining of positional information regarding the object, determinationof whether an event has occurred, and generation of a notificationmessage using different learning models.

According to an embodiment, a first learning model may be generated bylearning criteria for determining types of an object and criteria fordetermining positions of an object in a plurality of frames using, forexample, a fully convolutional network (FCN). The processor 1300 maydetermine the type of an object and determine the positions of theobject in the plurality of frames, by using the first learning model.

According to an embodiment, a second learning model may be generated bylearning criteria for determining whether an event related to driving ofa vehicle has occurred and criteria for determining the content of anotification message by analyzing time-series changes of positions of anobject in a plurality of frames using, for example, a recurrent neuralnetwork (RNN). The processor 1300 may determine whether an event relatedto driving of a vehicle has occurred and determine the content of anotification message, by using the second learning model.

According to an embodiment, the processor 1300 may apply a filter to aplurality of frames to flatten lightening degrees of the plurality offrames to input the plurality of frames to the first learning model andreduce the dimension of an output of the first learning model to inputthe output to the second learning model.

The sensing unit 1400 may include various sensors (sensing circuitry) tosense a state of the electronic device 1000, a state of a user, or astate around the electronic device 1000 and transmit sensed informationto the processor 1300.

The sensing unit 1400 may, for example, include, but is not limited to,at least one of a magnetic sensor 1410, an acceleration sensor 1420, atemperature/humidity sensor 1430, an infrared sensor 1440, a gyroscopesensor 1450, a position sensor (e.g., GPS) 1460, an atmospheric sensor1470, a proximity sensor 1480, and an RGB sensor 1490. The function ofeach sensor may be intuitively deduced from the name thereof by one ofordinary skill in the art, and thus detailed description thereof willnot be provided here.

The communication unit 1500 may include various communication circuitryincluding one or more components that enable the electronic device 1000to communicate with another electronic device (not shown) and the server2000. The other electronic device (not shown) may be, but is not limitedto, a computing device or a sensing device. Furthermore, for example,the other electronic device may be a module included in a vehicle likethe electronic device 1000. For example, the communication unit 1500 mayinclude a short-range communication unit 1510, a mobile communicationunit 1520, and a broadcast reception unit 1530.

The short-range communication unit 1510 may, for example, be, but is notlimited to, a Bluetooth communicator, a Bluetooth low energy (BLE)communicator, a near field communicator/RF identification (RFID)communicator, a WLAN (Wi-Fi) communicator (not shown), a Zigbeecommunicator, an infrared data association (IrDA) communicator, a Wi-Fidirect (WFD) communicator, an ultra wideband (UWB) communicator, and anAnt+ communicator.

The mobile communication unit 1520 may include various mobilecommunication circuitry that transmits and receives a wireless signal toand from at least one of a base station, an external terminal, and aserver on a mobile communication network. Here, the wireless signal mayinclude various types of data associated with transmission/reception ofa voice call signal, a video call signal, or a text/multimedia message.

The broadcast reception unit 1530 may include various broadcastreceiving circuitry that receives a broadcast signal and/orbroadcast-related information from the outside through a broadcastchannel. The broadcast channel may include a satellite channel and aterrestrial channel. The electronic device 1000 may not include thebroadcast reception unit 1530 according to some embodiments.

According to an embodiment, the communication unit 1500 may receive avideo sequence including a plurality of frames from a camera installedon a vehicle. According to an embodiment, the communication unit 1500may transmit a command for controlling an operation of the moduleinstalled on the vehicle to the module installed on the vehicle.

The A/V input unit 1600 is a unit for inputting an audio signal or avideo signal and may include various A/V input circuitry, such as, forexample, and without limitation, a camera 1610 and a microphone 1620.The camera 1610 may obtain an image frame, such as a still image or amoving picture, through an image sensor in a video call mode or aphotographing mode. An image captured through the image sensor may beprocessed by the processor 1300 or a separate image processor (notshown). For example, an image captured by the camera 1610 may beutilized as information for determining whether an event has occurred.

The microphone 1620 may receive an external acoustic signal andprocesses the external acoustic signal into electrical voice data. Forexample, microphone 1620 may receive an acoustic signal from an externalelectronic device or a user. The microphone 1620 may use various noisereduction algorithms to remove noises generated while an externalacoustic signal is being input.

The memory 1700 may store a program for data processing and controllingof the processor 1300 and may store data input to or output from theelectronic device 1000.

The memory 1700 may include at least one of be a flash memory, a harddisk, a multimedia card micro, a card type memory (e.g., an SD memory oran XD memory), a random access memory (RAM), a static random accessmemory (SRAM), a read-only memory (ROM), an electrically erasableprogrammable read-only memory (EEPROM), a programmable read-only memory(PROM), a magnetic memory, a magnetic disk, and an optical disc, or thelike, but is not limited thereto.

Programs stored in the memory 1700 may be classified into a plurality ofmodules according to their functions. For example, programs stored inthe memory 1700 may be classified into a UI module 1710, a touch screenmodule 1720, a notifying module 1730,

The UI module 1710 may provide a dedicated UI or a dedicated GUIassociated with the electronic device 1000 for each application. Thetouch screen module 1720 may sense a touch gesture on a user's touchscreen and transmit information regarding the touch gesture to theprocessor 1300. The touch screen module 1720 according to an embodimentmay recognize and analyze a touch code. The touch screen module 1720 mayalso be configured as separate hardware including a processor.

The notifying module 1730 may generate a signal for notifying anoccurrence of an event. The notifying module 1730 may output anotification signal in the form of a video signal through the display1210, may output a notification signal in the form of an audio signalthrough the acoustic output unit 1220, or may output a notificationsignal in the form of a vibration signal via the vibration motor 1230.

FIG. 10 is a block diagram illustrating the processor 1300 according toan embodiment.

Referring to FIG. 10, the processor 1300 according to an embodiment mayinclude a data learner (e.g., including processing circuitry and/orprogram elements) 1310 and a data recognizer (e.g., including processingcircuitry and/or program elements) 1320.

The data learner 1310 may include various processing circuitry and/orprogram elements that learn criteria for obtaining pixel information andgenerating a notification message. The data learner 1310 may learncriteria regarding which data to use for obtaining pixel data andgenerating a notification message and also may learn criteria regardinghow to obtain pixel information and generate a notification message byusing data. The data learner 1310 may obtain data to be used forlearning and applies the obtained data to a data recognition model to bedescribed below, thereby learning criteria for obtaining pixelinformation and generating a notification message.

Although it has been described above with respect to FIGS. 1 through 9that an operation for detecting an object, an operation for obtainingpositional information regarding the object, an operation fordetermining the type of the object, an operation for determining aposition of the object, an operation for determining whether an eventhas occurred, an operation for determining the type of the event, anoperation for determining a risk level of driving, an operation forgenerating a notification message, and an operation for determining thecontent of the notification message as independent operations, thepresent disclosure is not limited thereto. At least two of theoperations for detecting an object, the operation for obtainingpositional information regarding the object, the operation fordetermining the type of the object, the operation for determining aposition of the object, the operation for determining whether an eventhas occurred, the operation for determining the type of the event, theoperation for determining a risk level of driving, the operation forgenerating a notification message, and the operation for determining thecontent of the notification message may be performed based on a learningaccording to pre-set criteria.

The data recognizer 1320 may include various processing circuitry and/orprogram elements that obtain pixel information and generate anotification message, based on data. The data recognizer 1320 mayrecognize pixel information and a notification messages from certaindata using a learned data recognition model. The data recognizer 1320may obtain certain data according to pre-set criteria based on learningand utilizes a data recognition model using the obtained data as inputvalues thereto, thereby determining how to obtain pixel information andhow to generate a notification message, based on certain data.Furthermore, a result value output by a data recognition model usingobtained data as an input value may be used to update the datarecognition model.

At least one of the data reader 1310 and the data recognizer 1320 may,for example, and without limitation, be fabricated in the form of atleast one hardware chip and mounted on an electronic device. Forexample, at least one of the data learner 1310 and the data recognizer1320 may be fabricated in the form of a dedicated hardware chip forartificial intelligence (AI) or may be fabricated as a portion of ageneral-purpose processor (e.g., a CPU or an application processor) or agraphics-only processor (e.g., a GPU) and may be mounted on the variouselectronic devices as described above.

In this case, the data learner 1310 and the data recognizer 1320 may bemounted on one electronic device or separate electronic devices. Forexample, one of the data learner 1310 and the data recognizer 1320 maybe included in an electronic device, and the other one may be includedin a server. Furthermore, the data learner 1310 and the data recognizer1320 may be connected to each other via a wire or wirelessly, and thusthe data learner 1310 may provide model information constructed by thedata learner 1310 to the data recognizer 1320 and data input to the datarecognizer 1320 may be provided to the data learner 1310 as additionallearning data.

Meanwhile, at least one of the data reader 1310 and the data recognizer1320 may be implemented as a software module including program elements.When at least one of the data learner 1310 and the data recognizer 1320is implemented as a software module (or a program module includinginstructions), the software module may be stored in a non-transitorycomputer readable medium. Furthermore, in this case, at least onesoftware module may be provided by an operating system (OS) or providedby a certain application. Some of at least one of the software modulesmay be provided by an OS, and the remaining of the at least one ofsoftware modules may be provided by a certain application.

FIG. 11 is a block diagram illustrating the data learner 1310 accordingto an embodiment.

Referring to FIG. 11, the data learner 1310 according to someembodiments may include, for example, a data obtainer (e.g., includingprocessing circuitry and/or program elements) 1310-1, a pre-processor(e.g., including processing circuitry and/or program elements) 1310-2, alearning data selector (e.g., including processing circuitry and/orprogram elements) 1310-3, a model learner (e.g., including processingcircuitry and/or program elements) 1310-4, and a module evaluator (e.g.,including processing circuitry and/or program elements) 1310-5.

The data obtainer 1310-1 may obtain data necessary for determining howto obtain pixel information and how to generate a notification message.The data obtainer 1310-1 may obtain necessary data for learning todetermine of how to obtain pixel information and how to generate anotification message.

For example, the data obtainer 1310-1 may obtain voice data, image data,text data, or biometric signal data. For example, the data obtainer1310-1 may receive data via an input device (e.g., a microphone, acamera, a sensor, etc.) of the electronic device 1000. The data obtainer1310-1 may obtain data via another electronic device communicating withthe electronic device 1000. Alternatively, the data obtainer 1310-1 mayobtain data via a server communicating with the electronic device 1000.

For example, the data obtainer 1310-1 may receive a video sequence froma camera installed on a vehicle. Furthermore, for example, the dataobtainer 1310-1 may receive a video sequence from a camera capable ofphotographing the periphery of the vehicle. Furthermore, for example,the data obtainer 1310-1 may obtain a video sequence from a cameraprovided in the electronic device 1000.

The pre-processor 1310-2 may preprocess obtained data, such that theobtained data may be used for learning to determine how to obtain pixelinformation and how to generate a notification message. Thepre-processor 1310-2 may process the obtained data to a pre-set format,such that the obtained data may be used for learning to determine how toobtain pixel information and how to generate a notification message. Forexample, the pre-processor 1310-2 may perform pre-processing to apply afilter for flattening lightening degrees of a plurality of framesincluded in a video sequence to the plurality of frames.

The learning data selector 1310-3 may select data necessary for learningfrom preprocessed data. The selected data may be provided to the modellearner 1310-4. The learning data selector 1310-3 may select datanecessary for learning from the preprocessed data according to certaincriteria for determining how to obtain pixel information and how togenerate a notification message. Furthermore, the learning data selector1310-3 may select data according to certain criteria that is pre-setbased on learning by the model learner 1310-4, which will be describedbelow.

The model learner 1310-4 may learn criteria regarding how to obtainpixel information and how to generate a notification message based onlearning data. Furthermore, the model learner 1310-4 may learn criteriafor selecting learning data to use for determining how to obtain pixelinformation and how to generate a notification message.

Furthermore, the model learner 1310-4 may learn a data recognition modelused for determining how to obtain the pixel information and how togenerate a notification message based on learning data, by using thelearning data. In this case, the data recognition model may be a modelconstructed in advance. For example, the data recognition model may be amodel constructed in advance based on basic learning data (e.g., a blackbox image of a vehicle).

A data recognition model may, for example, and without limitation, beconstructed by taking a field of application of the data recognitionmodel, a purpose of learning, or computing performance of a device intoaccount. The data recognition model may be, for example, a model basedon a neural network. For example, a model, such as, for example, andwithout limitation, a deep neural network (DNN), a recurrent neuralnetwork (RNN), a fully convolutional network (FCN), and/or abidirectional recurrent deep neural network (BRDNN), or the like, may beused as the data recognition model, but the present disclosure is notlimited thereto.

According to various embodiments, when there are a plurality ofpre-constructed data recognition models, the model learner 1310-4 maydetermine a data recognition model of which input learning data ishighly related to basic learning data as a data recognition model tolearn. In this case, the basic learning data may be pre-classifiedaccording to data types, and data recognition models may bepre-constructed according to data types. For example, the basic learningdata may be pre-classified according to various criteria, such as areaswhere the basic learning data is generated, time points at which thebasic learning data is generated, sizes of the basic learning data,genres of the basic learning data, and creators of the basic learningdata,

The model learner 1310-4 may also learn a data recognition model using,for example, a learning algorithm including an error back-propagation ora gradient descent.

The model learner 1310-4 may learn a data recognition model through asupervised learning using, for example, learning data as an input value.Furthermore, the model learner 1310-4 may learn a data recognition modelthrough unsupervised learning for determining criteria for determininghow to obtain pixel information and how to generate a notificationmessage by learning data types necessary for determining how to obtainpixel information and how to generate a notification message based onlearning data. Furthermore, the model learner 1310-4 may learn a datarecognition model through reinforced learning using feedback regardingwhether a result of determining how to obtain pixel information and howto generate a notification message based on learning data is correct.

Furthermore, when the data recognition model is learned, the modellearner 1310-4 may store the learned data recognition model. In thiscase, the model learner 1310-4 may store the learned data recognitionmodel in a memory of the electronic device 1000 including the datarecognizer 1320. The model learner 1310-4 may store the learned datarecognition model in a memory of a server connected to the electronicdevice 1000 via a wired network or a wireless network.

In this case, the memory in which the learned data recognition model isstored may also store, for example, commands or data related to at leastone other component of the electronic device 1000. Furthermore, thememory may store software and/or programs. The programs may include, forexample, and without limitation, a kernel, a middleware, an applicationprogramming interface (API), and/or an application program (or“application”), or the like.

The module evaluator 1310-5 inputs evaluation data to a data recognitionmodel and, when a recognition result based on the evaluation data doesnot satisfy certain criteria, the module evaluator 1310-5 may make themodel learner 1310-4 to re-learn the data recognition model. In thiscase, the evaluation data may be pre-set data for evaluating the datarecognition model.

For example, when the number or a ratio of evaluation data correspondingto inaccurate recognition results from among recognition results of thelearned data recognition model with respect to evaluation data exceeds apreset critical value, the module evaluator 1310 may evaluate thatcertain criterion is not met. For example, when the certain criterion isdefined as a ratio of 2% and the learned data recognition model outputsincorrect recognition results for evaluation data exceeding 20 out of atotal of 1000 evaluation data, the module evaluator 1310-5 may evaluatethat the learned data recognition model is inappropriate.

On the other hand, when there are a plurality of learned datarecognition models, the module evaluator 1310-5 may evaluate whethereach of the learned data recognition models satisfies a certaincriterion and determine a learned data recognition model that satisfiesthe certain criterion as a final data recognition model. In this case,when there are a plurality of models satisfying the certain criterion,the module evaluator 1310-5 may determine any one or a certain number ofmodels preset in the descending order of evaluation scores as final datarecognition model(s).

Meanwhile, at least one of the data obtainer 1310-1, the pre-processor1310-2, the learning data selector 1310-3, the model learner 1310-4, andthe module evaluator 1310-4 in the data learner 1310 may be fabricatedin the form of at least one hardware chip and mounted on an electronicdevice. For example, at least one of the data obtainer 1310-1, thepre-processor 1310-2, the learning data selector 1310-3, the modellearner 1310-4, and the module evaluator 1310-5 may be fabricated as adedicated hardware chip for artificial intelligence (AI) or as a portionof a general-purpose processor (e.g., a CPU or an application processor)or a graphics dedicated processor (e.g., a GPU) and mounted on thevarious electronic devices as described above.

Furthermore, the data obtainer 1310-1, the pre-processor 1310-2, thelearning data selector 1310-3, the model learner 1310-4, and the moduleevaluator 1310-5 may be mounted on a single electronic device or may berespectively mounted on separate electronic devices. For example, someof the data obtainer 1310-1, the pre-processor 1310-2, the learning dataselector 1310-3, the model learner 1310-4, and the module evaluator1310-5 may be included in an electronic device, and the rest may beincluded in a server.

At least one of the data obtainer 1310-1, the pre-processor 1310-2, thelearning data selector 1310-3, the model learner 1310-4, and the moduleevaluator 1310-5 may be implemented as a software module. When at leastone of the data obtainer 1310-1, the pre-processor 1310-2, the learningdata selector 1310-3, the model learner 1310-4, and the module evaluator1310-5 is implemented as a software module (or a program moduleincluding instructions), the software module may be stored in anon-transitory computer readable medium. Furthermore, in this case, theat least one software module may be provided by an OS or provided by acertain application. Alternatively, some of the at least one softwaremodules may be provided by an OS, and the rest of the at least onesoftware modules may be provided by a certain application.

FIG. 12 is a block diagram illustrating the data recognizer 1320according to an embodiment.

Referring to FIG. 12, the data recognizer 1320 according to anembodiment may include a data obtainer (e.g., including processingcircuitry and/or program elements) 1320-1, a pre-processor (e.g.,including processing circuitry and/or program elements) 1320-2, arecognizing data selector (e.g., including processing circuitry and/orprogram elements) 1320-3, a recognition result provider (e.g., includingprocessing circuitry and/or program elements) 1320-4, and a modelupdater (e.g., including processing circuitry and/or program elements)1320-5.

The data obtainer 1320-1 may obtain data necessary for determining howto obtain pixel information and how to generate a notification messagebased on learning data, and the pre-processor 1320-2 may pre-processobtained data, such that the obtained data may be used to determine howto obtain pixel information and how to generate a notification message.The pre-processor 1320-2 may process the obtained data to a pre-setformat, such that the obtained data may be used by the recognitionresult provider 1320-4 to determine how to obtain pixel information andhow to generate a notification message.

The recognizing data selector 1320-3 may select data necessary fordetermining how to obtain pixel information and how to generate anotification message from the preprocessed data. The selected data maybe provided to the recognition result provider 1320-4. The recognizingdata selector 1320-3 may select some or all of the preprocessed dataaccording to pre-set criteria regarding how to obtain pixel informationand certain criteria for how to generate a notification message.Furthermore, the recognizing data selector 1320-3 may select dataaccording to pre-set criteria set based on a learning by the modellearner 1310-4, which will be described below.

The recognition result provider 1320-4 may apply the selected data to adata recognition model and determine how to obtain pixel information andhow to generate a notification message. The recognition result provider1320-4 may provide recognition results according to purposes of datarecognition. The recognition result provider 1320-4 may use dataselected by the recognizing data selector 1320-3 as input values,thereby applying the selected data to a data recognition model.Furthermore, recognition results may be determined by the datarecognition model.

The model updater 1320-5 may update a data recognition model based onevaluation of recognition results provided by the recognition resultprovider 1320-4. For example, the model updater 1320-5 may providerecognition results provided by the recognition result provider 1320-4to the model reader 1310-4, so that the model learner 1310-4 updates thedata recognition model.

Meanwhile, the data obtainer 1320-1, the pre-processor 1320-2, therecognizing data selector 1320-3, the recognition result provider1320-4, and the model updater 1320-4 in the data recognizer 1320 may befabricated in the form of at least one hardware chip and mounted on anelectronic device. For example, at least one of the data obtainer1320-1, the pre-processor 1320-2, the recognizing data selector 1320-3,the recognition result provider 1320-4, and the model updater 1320-5 maybe fabricated as a dedicated hardware chip for artificial intelligence(AI) or as a portion of a general-purpose processor (e.g., a CPU or anapplication processor) or a graphics dedicated processor (e.g., a GPU)and mounted on the various electronic devices as described above.

Furthermore, the data obtainer 1320-1, the pre-processor 1320-2, therecognizing data selector 1320-3, the recognition result provider1320-4, and the model updater 1320-5 may be mounted on a singleelectronic device or may be respectively mounted on separate electronicdevices. For example, some of the data obtainer 1320-1, thepre-processor 1320-2, the recognizing data selector 1320-3, therecognition result provider 1320-4, and the model updater 1320-5 may beincluded in an electronic device, and the rest may be included in aserver.

Furthermore, at least one of the data obtainer 1320-1, the pre-processor1320-2, the recognizing data selector 1320-3, the recognition resultprovider 1320-4, and the model updater 1320-5 may be implemented as asoftware module. When at least one of the data obtainer 1320-1, thepre-processor 1320-2, the recognizing data selector 1320-3, therecognition result provider 1320-4, and the model updater 1320-5 isimplemented as a software module (or a program module includinginstructions), the software module may be stored in a non-transitorycomputer readable medium. Furthermore, in this case, the at least onesoftware module may be provided by an OS or provided by a certainapplication. Alternatively, some of the at least one software modulesmay be provided by an OS, and the rest of the at least one of softwaremodules may be provided by a certain application.

FIG. 13 is a diagram illustrating an example of learning and recognizingdata as the electronic device 1000 operates in conjunction with theserver 2000, according to an embodiment.

Referring to FIG. 13, the server 2000 may learn criteria regarding howto obtain pixel information and how to generate a notification message.The electronic device 1000 may determine how to obtain pixel informationand how to generate a notification message based on learning results ofthe server 2000. The server may include the functions of the datalearner 1310 described above with reference to FIG. 11, and include, forexample, a data obtainer 2310, a pre-processor 2320, a learning dataselector 2330, a model learner 2340 and a model evaluator 2350, whichcorrespond, for example, to similarly named elements of FIG. 11, andthus a detailed description thereof will not be repeated here.

In this case, a data learner 2300 of the server 2000 may perform thefunction of the data learner 1310 illustrated in FIG. 11. The datalearner 2300 of the server 2000 may learn criteria regarding how toobtain pixel information, which data to use to determine how to generatea notification message, how to obtain the pixel information by usingcertain data, and how to generate the notification message. The datalearner 2300 may obtain data to be used for learning and apply theobtained data to a data recognition model to be described below, therebylearning criteria regarding how to obtain pixel information and how togenerate a notification message.

Furthermore, the recognition result provider 1320-4 of the electronicdevice 1000 may apply data selected by the recognizing data selector1320-3 to a data recognition model generated by the server 2000 todetermine how to obtain pixel information and how to generate anotification message. For example, the recognition result provider1320-4 may transmit data selected by the recognizing data selector1320-3 to the server 2000 and request the server 2000 to apply the dataselected by the recognizing data selector 1320-3 to a data recognitionmodel and determine how to obtain pixel information and how to generatea notification message. Furthermore, the recognition result provider1320-4 may receive from the server 2000 information regarding how toobtain pixel information and how to generate a notification messagedetermined by the server 2000.

The recognition result provider 1320-4 of the electronic device 1000 mayreceive a data recognition model generated by the server 2000 from theserver 2000 and determine how to obtain pixel information and how togenerate a notification message using the received data recognitionmodel. In this case, the recognition result provider 1320-4 of theelectronic device 1000 may apply data selected by the recognizing dataselector 1320-3 to the data recognition model received from the server2000, thereby determining how to obtain pixel information and how togenerate a notification message.

Some embodiments may also be implemented in the form of a recordingmedium including instructions executable by a computer, such as aprogram module, executed by a computer. Computer readable medium may beany available medium that may be accessed by a computer and include bothvolatile and nonvolatile media and both removable and non-removablemedia. Furthermore, computer-readable media may also include computerstorage media. Computer storage media may includes both volatile andnonvolatile media and both removable and non-removable media implementedin any method or technology for storage of information, such as computerreadable instructions, data structures, program modules or other data.

Furthermore, in this specification, the term “-er”, “-or”, or “unit” maybe a hardware component, such as a processor or a circuit, and/or asoftware component executed by the hardware component like a processor.

While the present disclosure has been illustrated and described withreference to various example embodiments thereof, it will be understoodby those of ordinary skill in the art that various changes in form anddetails may be made therein without departing from the spirit and scopeof the present disclosure as defined by the following claims. Hence, itwill be understood that the example embodiments described above aremerely illustrative and do not limit the scope of the presentdisclosure. For example, each component described in a single type maybe executed in a distributed manner, and components describeddistributed may also be executed in an integrated form.

It should be understood that the claims and all modifications ormodified forms drawn from the concept of the claims are included in thescope of the present disclosure.

What is claimed is:
 1. An electronic device configured to control anoperation of a vehicle, the electronic device comprising: a memoryconfigured to store at least one program; and at least one processorconfigured to provide a notification message by executing the at leastone program, wherein the at least one program includes instructions,which when executed by the processor, cause the electronic device toperform at least one operation comprising: obtaining a video sequencecomprising a plurality of frames from a camera installed on the vehicle;detecting, from the plurality of frames, an object included in theplurality of frames via a first learning model; obtaining positioninformation regarding the detected object with respect to each of theplurality of frames via the first learning model; determining whether anevent related to driving of the vehicle has occurred by analyzingtime-series changes in positions of the detected object in the pluralityof frames via a second learning model; generating a notification messageabout the event based on a result of the determining via the secondlearning model; and outputting the generated notification message,wherein an input to the second learning model includes an output of thefirst learning model, the output of the first learning model includingthe position information regarding the detected object with respect toeach of the plurality of frames.
 2. The electronic device of claim 1,wherein the at least one program includes instructions, which whenexecuted by the processor, cause the electronic device to perform atleast one operation comprising the plurality of frames being input intothe first learning model, and the position information output by thefirst learning model includes a group of pixels in each of the pluralityof frames indicating the position of the detected object.
 3. Theelectronic device of claim 1, wherein the determining of whether anevent has occurred comprises: determining a type of the event; anddetermining a risk level of driving, and the generating of thenotification message comprises generating the notification message basedon the type of the event and the risk level of driving.
 4. Theelectronic device of claim 3, wherein a command for controlling anoperation of a module installed on the vehicle is transmitted to themodule installed on the vehicle based on the type of the event and therisk level of driving.
 5. The electronic device of claim 1, wherein theoutputting of the notification message comprises displaying thenotification message on a head-up display (HUD) of the vehicle.
 6. Theelectronic device of claim 1, wherein the obtaining of the positionalinformation regarding the object comprises determining positions of theobject in the plurality of frames on a pixel-by-pixel basis.
 7. Theelectronic device of claim 1, wherein the first learning model isgenerated using a fully convolutional network (FCN), by learningcriteria for determining a type of the object from a plurality ofdifferent types of objects and learning criteria for determiningpositions of the object in the plurality of frames, the type of theobject is determined using the first learning model in the detecting ofthe object, and positions of the object in the plurality of frames aredetermined with respect to the plurality of frames using the firstlearning model in the obtaining of the positional information regardingthe object.
 8. The electronic device of claim 1, wherein the input tothe second learning model is obtained by reducing dimensions of theoutput of the first learning model.
 9. The electronic device of claim 1,wherein the second learning model is generated using a recurrent neuralnetwork (RNN), by learning criteria for determining whether an eventrelated to driving of a vehicle has occurred and learning criteria fordetermining the content of a notification message, and the generating ofthe notification message comprises, determining the content of thenotification message using the second learning model.
 10. The electronicdevice of claim 1, wherein the at least one program further comprisesinstructions for applying, to the plurality of frames, a filter forflattening lightening degrees of the plurality of frames to input theplurality of frames to the first learning model.
 11. A method ofcontrolling an operation of a vehicle, the method comprising: obtaininga video sequence comprising a plurality of frames from a camerainstalled on the vehicle; detecting, from the plurality of frames, anobject included in the plurality of frames via a first learning model;obtaining position information regarding the detected object withrespect to each of the plurality of frames via the first learning model;determining whether an event related to driving of the vehicle hasoccurred by analyzing time-series changes in positions of the detectedobject in the plurality of frames via a second learning model;generating a notification message about the event based on a result ofthe determining via the second learning model; and outputting thegenerated notification message, wherein an input to the second learningmodel includes an output of the first learning model, the output of thefirst learning model including the position information regarding thedetected object with respect to each of the plurality of frames.
 12. Themethod of claim 11, wherein the plurality of frames are input into thefirst learning model, and the position information output by the firstlearning model includes a group of pixels in each of the plurality offrames indicating the position of the detected object.
 13. The method ofclaim 11, wherein the determining of whether an event has occurredcomprises: determining a type of the event; and determining a risk levelof driving, and the generating of the notification message comprisesgenerating the notification message based on the type of the event andthe risk level of driving.
 14. The method of claim 13, wherein a commandfor controlling an operation of a module installed on the vehicle istransmitted to the module installed on the vehicle based on the type ofthe event and the risk level of driving.
 15. The method of claim 11,wherein the obtaining of the positional information regarding the objectcomprises determining positions of the object in the plurality of frameson a pixel-by-pixel basis.
 16. The method of claim 11, wherein the firstlearning model is generated using a fully convolutional network (FCN),by learning criteria for determining a type of the object from aplurality of different types of objects and learning criteria fordetermining positions of the object in the plurality of frames, the typeof the object is determined using the first learning model in thedetecting of the object, and positions of the object in the plurality offrames are determined with respect to the plurality of frames using thefirst learning model in the obtaining of the positional informationregarding the object.
 17. The method of claim 11, wherein the input tothe second learning model is obtained by reducing dimensions of theoutput of the first learning model.
 18. The method of claim 11, whereinthe second learning model is generated using a recurrent neural network(RNN), by learning criteria for determining whether an event related todriving of a vehicle has occurred and learning criteria for determiningthe content of a notification message, and the generating of thenotification message comprises, determining the content of thenotification message using the second learning model.
 19. The method ofclaim 11, further comprising: applying, to the plurality of frames, afilter for flattening lightening degrees of the plurality of frames toinput the plurality of frames to the first learning model.
 20. Anon-transitory computer readable recording medium having recordedthereon a computer program which, when executed by a processor, causesthe processor to: obtain a video sequence comprising a plurality offrames from a camera installed on a vehicle; detect, from the pluralityof frames, an object included in the plurality of frames using a firstlearning model; obtain position information regarding the detectedobject with respect to each of the plurality of frames using the firstlearning model; determine whether an event related to driving of thevehicle has occurred by analyzing time-series changes in positions ofthe detected object in the plurality of frames using a second learningmodel; generate a notification message about the event based on a resultof the determining using the second learning model; and output thegenerated notification message, wherein an input to the second learningmodel includes an output of the first learning model, the output of thefirst learning model including the position information regarding thedetected object with respect to each of the plurality of frames.